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Abstract 

Many e-commerce websites use recommender systems 
to recommend items to users. When a user or item 
is new, the system may fail because not enough in¬ 
formation is available on this user or item. Vari¬ 
ous solutions to this ‘cold-start problem’ have been 
proposed in the literature. However, many real-life 
e-commerce applications suffer from an aggravated, 
recurring version of cold-start even for known users 
or items, since many users visit the website rarely, 
change their interests over time, or exhibit differ¬ 
ent personas. This paper exposes the Continuous 
Cold Start (CoCoS) problem and its consequences 
for content- and context-based recommendation from 
the viewpoint of typical e-commerce applications, il¬ 
lustrated with examples from a major travel recom¬ 
mendation website, Booking.com. 

Terms: CoCoS: continuous cold start 
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1 Introduction 

Many e-commerce websites are built around serv¬ 
ing personalized recommendations to users. Ama¬ 
zon.com recommends books, Booking.com recom¬ 
mends accommodations, Netflix recommends movies, 


Reddit recommends news stories, etc. Two examples 
of recommendations of accomodations and destina¬ 
tions at Booking.com are shown in Figure 1. This 
widescale adoption of recommender systems online, 
and the challenges faced by industrial applications, 
have been a driving force in the development of rec¬ 
ommender systems. The research area has been ex¬ 
panding since the first papers on collaborative filter¬ 
ing in the 1990s [12, 16]. Many different recommen¬ 
dation approaches have been developed since then, in 
particular content-based and hybrid approaches have 
supplemented the original collaborative filtering tech¬ 
niques [1]. 

In the most basic formulation, the task of a rec¬ 
ommender system is to predict ratings for items that 
have not been seen by the user. Using these pre¬ 
dicted ratings, the system decides which new items 
to recommend to the user. Recommender systems 
base the prediction of unknown ratings on past or 
current information about the users and items, such 
as past user ratings, user profiles, item descriptions 
etc. If this information is not available for new users 
or items, the recommender system runs into the so- 
called cold-start problem: It does not know what to 
recommend until the new, ‘cold’, user or item has 
‘warmed-up’, i.e. until enough information has been 
generated to produce recommendations. For exam¬ 
ple, which accomodations should be recommended to 
someone who visits Booking.com for the first time? 
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Customers who viewed Hotel Sacher Wien also viewed: 


Destinations related to Vienna: 


Grand Hotel Wien 

***** 


Only a 3-minute walk 
from the Vienna State 
j Opera and the famous 
I Karntner StraBe 
shopping street, the 5- 
star Grand Hotel Wien offers you 
luxurious rooms and 2 gourmet 
restaurants, as well as a large... 


Hotel de France Wien 

★★★★★ 


Le Meridien Vienna 

***** 


| Centrally located on 
I Vienna's elegant 
RingstraBe boulevard, 

I Hotel de France Wien 
offers a restaurant, a bar 
with piano music, free 24-hour gym 
access, and free WiFi in all 
areas.The Schottentor... 


Offering free access to 
its luxurious spa, indoor 
pool, and 24-hour 
fitness centre, free WiFi 
access, and free minibar 
use with a daily refill, Le Meridien 
Vienna is located right at the 



Prague ^ 

1037 properties available 


Figure 1: Examples of recommender systems on Booking.com. User-to-user collaborative filtering (left): 
recommend accomodations viewed by similar users to a user who just looked at ‘Hotel Sacher Wien’. Item- 
to-item content-based recommendations (right): recommend destinations similar to a particular destination, 
Vienna. 


If the recommender system is based on which acco¬ 
modations users have clicked on in the past, the first 
recommendations can only be made after the user has 
clicked on a couple of accomodations on the website. 

Several approaches have been proposed and suc¬ 
cessfully applied to deal with the cold-start problem, 
such as utilizing baselines for cold users [8], combin¬ 
ing collaborative filtering with content-based recom- 
menders in hybrid systems [14], eliciting ratings from 
new users [11], or, more recently, exploiting the so¬ 
cial network of users [6, 15]. In particular, content- 
based approaches have been very successful in deal¬ 
ing with cold-start problems in collaborative filtering 
[3, 4, 13, 14]. 

These approaches deal explicitly with cold users or 
items, and provide a ‘fix’ until enough information 
has been gathered to apply the core recommender 
system. Thus, rather than providing unified recom¬ 
mendations for cold and warm users, they temporar¬ 
ily bridge the period during which the user or item is 
‘cold’ until it is ‘warm’. This can be very successful 
in situations in which there are no warm users [3] , or 
in situations when the warm-up period is short and 
warmed-up users or items stay warm. 

However, in many practical e-commerce applica¬ 
tions, users or items remain cold for a long time, and 
can even ‘cool down’ again, leading to a continuous 
cold-start (CoCoS). In the example of Booking.com, 
many users visit and book infrequently since they go 
on holiday only once or twice a year, leading to a pro¬ 
longed cold-start and extreme sparsity of collabora¬ 
tive filtering matrices, see Fig. 2 (top). In addition, 


even warm long-term users can cool down as they 
change their needs over time, e.g. going from booking 
youth hostels for road trips to resorts for family vaca¬ 
tions. Such cool-downs can happen more frequently 
and rapidly for users who book accommodations for 
different travel purposes, e.g. for leisure holidays and 
business trips as shown in Fig. 2 (bottom). These 
continuous cold-start problems are rarely addressed 
in the literature despite their relevance in industrial 
applications. Classical approaches to the cold-start 
problem fail in the case of CoCoS, since they assume 
that users warm up in a reasonable time and stay 
warm after that. 

In the remainder of the paper, we will elabo¬ 
rate on how CoCoS appears in e-commerce websites 
(Sec. 2), outline some approaches to the CoCoS prob¬ 
lem (Sec. 3), and end with a discussion about possible 
future directions (Sec. 4). 

2 Continuous Cold-Start 

Cold-start problems can in principle arise on both the 
user side and the items side. 

2.1 User Continuous Cold-Start 

We first focus on the user side of CoCoS, which can 
arise in the following cases: 

Classical cold-start / sparsity: new or rare users 
Volatility: user interest changes over time 
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Personas: user has different interests at different, 
possibly close-by points in time 

Identity: failure to match data from the same user 

All cases arise commonly in e-commerce websites. 
New users arrive frequently (classical cold-start), or 
may appear new when they don’t log in or use a dif¬ 
ferent device (failed identity match). Some websites 
are prone to very low levels of user activity when 
items are purchased only rarely, such as travel, cars 
etc., leading to sparsity problems for recommender 
systems. Most users change their interests over time 
(volatility), e.g. movie preferences evolve, or travel 
needs change. On even shorter timescales, users have 
different personas. Depending on their mood or their 
social context, they might be interested in watching 
different movies. Depending on the weather or their 
travel purpose, they may want to book different types 
of trips, see Figure 2 for examples from Booking.com. 

These issues arise for collaborative filtering as well 
as content-based or hybrid approaches, since both 
user ratings or activities as well user profiles might 
be missing, become outdated over time, or not be 
relevant to the current user persona. 

2.2 Item Continuous Cold-Start 

In a symmetric way, these CoCoS problems also arise 
for items: 

Classical cold-start / sparsity: new or rare items 

Volatility: item properties or value changes over 
time 

Personas: item appeals to different types of users 

Identity: failure to match data from the same item 

New items appear frequently in e-commerce cata¬ 
logues, as shown in Figure 3 for accommodations 
at Booking.com. Some items are interesting only 
to niche audiences, or sold only rarely, for example 
books or movies on specialized topics. Items can be 
volatile if their properties change over time, such as s 
phone that becomes outdated once a newer model is 
released, or a hotel that undergoes a renovation. In 


the context of news or conversions, item volatility is 
also known as topic drift [9]. Figure 3 on the right 
shows fluctuations of the review score of a hotel at 
Booking.com. Some items have different ‘personas’ 
in that they target several user groups, such as a ho¬ 
tel that caters to business as well as leisure travellers. 
When several sellers can add items to an e-commerce 
catalogue, or when several catalogues are combined, 
correctly matching items can be problematic (iden¬ 
tity problem). 


3 Addressing Cold-Start 

Many approaches have been proposed to deal with 
the classical cold-start problem of new or rare users 
or items [11]. However, they mostly fail to address 
the more difficult CoCoS. 

The most popular strategy to address the clas¬ 
sic cold-start problem is the hybrid approach where 
collaborative filtering and content-based models are 
combined, see [14] as an example. If one of the two 
method fails due to a new user or item, the other 
method is used to ‘fill-in’. The most basic assump¬ 
tion is that similar users will like similar items. Sim¬ 
ilarity of users is measured by their purchase history 
when warm, and by their user profile when cold. Con¬ 
versely, similarities between items is computed by 
the set of users that purchased them when warm, 
and by their content when cold. In CoCoS, users 
change their interests, so both collaborative filtering 
and user-profile-based approaches can fail, since look¬ 
ing at the past and similarities can be misleading. 
Items also suffer from volatility, although to a lesser 
degree, which makes the standard hybrid approach 
also problematic for items. Hybrid approaches also 
ignore the issue of multiple personas. 

Although, to our knowledge, the continuous cold- 
start problem as defined in this work has not been di¬ 
rectly addressed in the literature, several approaches 
are promising. 

Tang et al. [19] propose a context-aware recom¬ 
mender system, implemented as a contextual multi¬ 
armed bandits problem. Although the authors report 
extensive offline evaluation (log based and simulation 
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Figure 2: Continuously cold users at Booking.com. Activity levels of two randomly chosen users of Book¬ 
ing.com over time. The top user exhibits only rare activity throughout a year, and the bottom user has two 
different personas, making a leisure and a business booking, without much activity inbetween. 
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Figure 3: Continuously cold items at Booking.com. Thousands of new accommodations are added to Book¬ 
ing.com every month (left). The user ratings of a hotel can change continuously (right). 


based) with acceptable CTR, no comparison is made 
from a cold-start problem standpoint. 

Sun et al. [18] explicitly attack the user volatility 
problem. They propose a dynamic extension of ma¬ 
trix factorization where the user latent space is mod¬ 
eled by a state space model fitted by a Kalman fil¬ 
ter. Generative data presenting user preference tran¬ 


sitions is used for evaluation. Improvements of RMSE 
when compared to timeSVD [10] are reported. Con¬ 
sistent results are reported in [5], after offline evalu¬ 
ation using real data. 
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Tavakol and Brefeld [20] propose a topic driven 
recommender system. At the user session level, the 
user intent is modeled as a topic distribution over all 
















the possible item attributes. As the user interacts 
with the system, the user intent is predicted and rec¬ 
ommendations are computed using the corresponding 
topic distribution. The topic prediction is solved by 
factored Markov decision processes. Evaluation on 
an e-commerce data set shows improvements when 
compared to collaborative filtering methods in terms 
of average rank. 

4 Discussion 

In this manuscript, we have described how CoCoS, 
the continuous cold-start problem, is a common is¬ 
sue for e-commerce applications. Industrial recom- 
mender systems do not only have to deal with ‘cold’ 
(new or rare) users and items, but also with known 
users or items that repeatedly ‘cool down’. Reasons 
for the recurring cool-downs include the volatility in 
user interests or item values, different personas de¬ 
pending on user context or item target audience, or 
identification problems due to logged-out users or 
items from different catalogues. Despite the practical 
relevance of CoCoS, common literature approaches 
do not deal well with this issue. 

We consider several directions as particularly 
promising to deal with CoCoS. Traditional ap¬ 
proaches to solve cold-start problems try to employ 
collaborative filtering based on pseudo or inferred 
clicks. Recommendations based on social networks 
are an interesting new development that can supple¬ 
ment missing information based on the social graph. 
For example, recommendations based on Facebook 
likes are proposed in [15]. Beyond the difficulty to get 
access to social data, the application to user volatil¬ 
ity or multiple personas remains challenging. Online 
user intent prediction can be used to estimate a user’s 
current profile on the fly. When a user visits the web¬ 
site, his browsing behavior is used to estimate his 
intent after a few clicks, which are then used to com¬ 
pute recommendations accordingly. However, this 
still delays recommendations until enough clicks have 
occurred, which can be problematic if quick recom¬ 
mendations are needed. For example, in last-minute 
bookings, users may be pressed to book an accom¬ 
modation quickly, leading to very short sessions. 


More promising approaches employ content based 
or contextual recommendation. Content based rec¬ 
ommendations can be very effective based on very 
little signal: just an initial query or single interaction 
can be exploited to find an initial item or set of items 
and exploit relations between items to make effective 
recommendations. In particular context aware rec¬ 
ommendations are one of the most promising strate¬ 
gies when it comes to solving CoCoS. In this setup, 
recommendations are computed based on the current 
context of the current visitor and the behaviour of 
other users in similar contexts [see 2, 7, 17] for ex¬ 
amples. Context is defined as a set of features such 
as location, time, weather, device, etc. Often this 
data is readily available in most commercial imple¬ 
mentations of recommender systems. This approach 
naturally addresses sparsity by clustering users into 
contexts. Since context is determined in a per-act ion 
basis, user volatility and multiple personas can be ad¬ 
dressed robustly. On the other hand, context aware 
recommenders cannot address the item side of the 
problem and they might also suffer from cold-start 
problems in the case of a cold context that has never 
seen before by the system. 
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